智能论文笔记

Predição da Idade Cerebral a partir de Imagens de Ressonância Magnética utilizando Redes Neurais Convolucionais

Victor H. R. Oliveira , Augusto Antunes , Alexandre S. Soares , Arthur D. Reys , Robson Z. Júnior , Saulo D. S. Pedro , Danilo Silva

分类：计算机视觉

2021-12-23

在这项工作中，研究了来自磁共振图像的脑年龄预测的深度学习技术，旨在帮助鉴定天然老化过程的生物标志物。生物标志物的鉴定可用于检测早期神经变性过程，以及预测与年龄相关或与非年龄相关的认知下降。在这项工作中实施并比较了两种技术：应用于体积图像的3D卷积神经网络和应用于从轴向平面的切片的2D卷积神经网络，随后融合各个预测。通过2D模型获得的最佳结果，其达到了3.83年的平均绝对误差。 - Neste Trabalho S \〜AO InvestigaDAS T \'Ecnicas de Aprendizado Profundo Para a previ \ c {c} \〜ate daade脑电站a partir de imagens de resson \ ^ ancia magn \'etica，Visando辅助Na Identifica \ c {C} \〜AO de BioMarcadores Do Processo Natural de Envelhecimento。一个identifica \ c {c} \〜ao de bioMarcarcores \'e \'util para a detec \ c {c} \〜ao de um processo neurodegenerativo em Est \'Agio无数，Al \'em de possibilitar Prever Um decl 'inio cognitivo relacionado ou n \〜ao \`一个懒惰。 Duas T \'ECICAS S \〜AO ImportyAdas E Comparadas Teste Trabalho：Uma Rede神经卷应3D APLICADA NA IMAGEM VOLUM \'ETRICA E UME REDE神经卷轴2D APLICADA A FATIAS DO PANIAS轴向，COM后面fus \〜AO DAS PREDI \ C {c} \ \ oes个人。 o Melhor ResultAdo Foi optido Pelo Modelo 2D，Que Alcan \ C {C} OU UM ERRO M \'EDIO ABSOLUTO DE 3.83 ANOS。

translated by 谷歌翻译

Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection

Vinícius Camargo da Silva , João Paulo Papa , Kelton Augusto Pontara da Costa

分类：自然语言处理 | 机器学习

2022-12-21

Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summarization and the importance it may have for evolving the current state of the ATS field, this work studies the application of two modern Generalized Additive Models with interactions, namely Explainable Boosting Machine and GAMI-Net, to the extractive summarization problem based on linguistic features and binary classification.

translated by 谷歌翻译

Clinical Deterioration Prediction in Brazilian Hospitals Based on Artificial Neural Networks and Tree Decision Models

Hamed Yazdanpanah , Augusto C. M. Silva , Murilo Guedes , Hugo M. P. Morales , Leandro dos S. Coelho , Fernando G. Moro

分类：机器学习

2022-12-17

Early recognition of clinical deterioration (CD) has vital importance in patients' survival from exacerbation or death. Electronic health records (EHRs) data have been widely employed in Early Warning Scores (EWS) to measure CD risk in hospitalized patients. Recently, EHRs data have been utilized in Machine Learning (ML) models to predict mortality and CD. The ML models have shown superior performance in CD prediction compared to EWS. Since EHRs data are structured and tabular, conventional ML models are generally applied to them, and less effort is put into evaluating the artificial neural network's performance on EHRs data. Thus, in this article, an extremely boosted neural network (XBNet) is used to predict CD, and its performance is compared to eXtreme Gradient Boosting (XGBoost) and random forest (RF) models. For this purpose, 103,105 samples from thirteen Brazilian hospitals are used to generate the models. Moreover, the principal component analysis (PCA) is employed to verify whether it can improve the adopted models' performance. The performance of ML models and Modified Early Warning Score (MEWS), an EWS candidate, are evaluated in CD prediction regarding the accuracy, precision, recall, F1-score, and geometric mean (G-mean) metrics in a 10-fold cross-validation approach. According to the experiments, the XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.

translated by 谷歌翻译

Privacy-Preserving Data Synthetisation for Secure Information Sharing

Tânia Carvalho , Nuno Moniz , Pedro Faria , Luís Antunes , Nitesh Chawla

分类：机器学习

2022-12-01

We can protect user data privacy via many approaches, such as statistical transformation or generative models. However, each of them has critical drawbacks. On the one hand, creating a transformed data set using conventional techniques is highly time-consuming. On the other hand, in addition to long training phases, recent deep learning-based solutions require significant computational resources. In this paper, we propose PrivateSMOTE, a technique designed for competitive effectiveness in protecting cases at maximum risk of re-identification while requiring much less time and computational resources. It works by synthetic data generation via interpolation to obfuscate high-risk cases while minimizing data utility loss of the original data. Compared to multiple conventional and state-of-the-art privacy-preservation methods on 20 data sets, PrivateSMOTE demonstrates competitive results in re-identification risk. Also, it presents similar or higher predictive performance than the baselines, including generative adversarial networks and variational autoencoders, reducing their energy consumption and time requirements by a minimum factor of 9 and 12, respectively.

translated by 谷歌翻译

1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

Benjamin Kiefer , Matej Kristan , Janez Perš , Lojze Žust , Fabio Poiesi , Fabio Augusto de Alcantara Andrade , Alexandre Bernardino , Matthew Dawkins , Jenni Raitoharju , Yitong Quan

分类：计算机视觉 | 人工智能 | 机器学习 | 机器人

2022-11-24

The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detection. The subchallenges were based on the SeaDronesSee and MODS benchmarks. This report summarizes the main findings of the individual subchallenges and introduces a new benchmark, called SeaDronesSee Object Detection v2, which extends the previous benchmark by including more classes and footage. We provide statistical and qualitative analyses, and assess trends in the best-performing methodologies of over 130 submissions. The methods are summarized in the appendix. The datasets, evaluation code and the leaderboard are publicly available at https://seadronessee.cs.uni-tuebingen.de/macvi.

translated by 谷歌翻译

ASAP: Adaptive Transmission Scheme for Online Processing of Event-based Algorithms

Raul Tapia , José Ramiro Martínez-de Dios , Augusto Gómez Eguíluz , Anibal Ollero

分类：机器人

2022-09-18

在复杂，非结构化和动态环境中导航的董事会机器人基于在线事件的感知技术可能会遭受进入事件速率及其处理时间的不可预测的变化，这可能会导致计算溢出或响应能力损失。本文提出了尽快的：一种新型的事件处理框架，该框架将事件传输到处理算法，保持系统响应能力并防止溢出。尽快由两种自适应机制组成。第一个通过丢弃传入事件的自适应百分比来防止事件处理溢出。第二种机制动态调整事件软件包的大小，以减少事件生成和处理之间的延迟。ASAP保证了收敛性，并且对处理算法具有灵活性。它已在具有挑战性的条件下在船上进行了验证。

translated by 谷歌翻译

ASAP: Adaptive Scheme for Asynchronous Processing of Event-based Vision Algorithms

Raul Tapia , Augusto Gómez Eguíluz , José Ramiro Martínez-de Dios , Anibal Ollero

分类：计算机视觉 | 机器人

2022-09-18

事件摄像机可以通过非常高的时间分辨率和动态范围来捕获像素级照明变化。由于对照明条件和运动模糊的稳健性，他们获得了越来越多的研究兴趣。文献中存在两种主要方法，用于喂养基于事件的处理算法：在事件软件包中包装触发的事件并将它们逐一发送作为单个事件。这些方法因处理溢出或缺乏响应性而受到限制。当算法无法实时处理所有事件时，处理溢出是由高事件产生速率引起的。相反，当事件包的频率太低时，事件包的生成率低时，缺乏响应率会发生。本文提出了尽快的自适应方案，该方案是通过可容纳事件软件包处理时间的可变大小软件包来管理事件流的。实验结果表明，ASAP能够以响应性和有效的方式喂食异步事件聚类算法，同时又可以防止溢出。

translated by 谷歌翻译

Recovering the Graph Underlying Networked Dynamical Systems under Partial Observability: A Deep Learning Approach

Sérgio Machado , Anirudh Sridhar , Paulo Gil , Jorge Henriques , José M. F. Moura , Augusto Santos

分类：机器学习

2022-08-08

我们研究了图结构识别的问题，即在时间序列之间恢复依赖图的图。我们将这些时间序列数据建模为线性随机网络动力学系统状态的组成部分。我们假设部分可观察性，其中仅观察到一个包含网络的节点子集的状态演变。我们设计了一个从观察到的时间序列计算的新功能向量，并证明这些特征是线性可分离的，即存在一个超平面，该超平面将与连接的节点成对相关的特征群体与与断开对相关的节点相关联。这使得可以训练各种分类器进行因果推理的功能。特别是，我们使用这些功能来训练卷积神经网络（CNN）。由此产生的因果推理机制优于最先进的W.R.T.样品复杂性。受过训练的CNN概括了结构上不同的网络（密集或稀疏）和噪声级别的轮廓。值得注意的是，他们在通过合成网络（随机图的实现）训练时也很好地概括了现实世界网络。最后，提出的方法始终以成对的方式重建图，也就是说，通过确定每对相应的时间序列中的每对节点中是否存在边缘或箭头或不存在箭头。这符合大规模系统的框架，在该系统中，网络中所有节点的观察或处理都令人难以置信。

translated by 谷歌翻译

No Pattern, No Recognition: a Survey about Reproducibility and Distortion Issues of Text Clustering and Topic Modeling

Marília Costa Rosendo Silva , Felipe Alves Siqueira , João Pedro Mantovani Tarrega , João Vitor Pataca Beinotti , Augusto Sousa Nunes , Miguel de Mattos Gardini , Vinícius Adolfo Pereira da Silva , Nádia Félix Felipe da Silva , André Carlos Ponce de Leon Ferreira de Carvalho

分类：机器学习 | 自然语言处理 | (统计)机器学习

2022-08-02

使用机器学习算法从未标记的文本中提取知识可能很复杂。文档分类和信息检索是两个应用程序，可以从无监督的学习（例如文本聚类和主题建模）中受益，包括探索性数据分析。但是，无监督的学习范式提出了可重复性问题。初始化可能会导致可变性，具体取决于机器学习算法。此外，关于群集几何形状，扭曲可能会产生误导。在原因中，异常值和异常的存在可能是决定因素。尽管初始化和异常问题与文本群集和主题建模相关，但作者并未找到对它们的深入分析。这项调查提供了这些亚地区的系统文献综述（2011-2022），并提出了共同的术语，因为类似的程序具有不同的术语。作者描述了研究机会，趋势和开放问题。附录总结了与审查的作品直接或间接相关的文本矢量化，分解和聚类算法的理论背景。

translated by 谷歌翻译

Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis

Paolo Braca , Leonardo M. Millefiori , Augusto Aubry , Stefano Marano , Antonio De Maio , Peter Willett

分类： (统计)机器学习 | 人工智能 | 机器学习

2022-07-22

我们研究了机器学习（ML）分类技术的误差概率收敛到零的速率的性能。利用大偏差理论，我们为ML分类器提供了数学条件，以表现出误差概率，这些误差概率呈指数级消失，例如$ \ sim \ exp \ left（-n \，i + o（i + o（n）\ right）$，其中$ n $是可用于测试的信息的数量（或其他相关参数，例如图像中目标的大小），而$ i $是错误率。这样的条件取决于数据驱动的决策功能的累积生成功能的Fenchel-Legendre变换（D3F，即，在做出最终二进制决策之前的阈值）在训练阶段中学到的。因此，D3F以及相关的错误率$ $ $取决于给定的训练集，该集合假定有限。有趣的是，可以根据基础统计模型的可用信息生成的可用数据集或合成数据集对这些条件进行验证和测试。换句话说，分类误差概率收敛到零，其速率可以在可用于培训的数据集的一部分上计算。与大偏差理论一致，我们还可以以足够大的$ n $为高斯分布的归一化D3F统计量来确定收敛性。利用此属性设置所需的渐近错误警报概率，从经验上来说，即使对于$ n $的非常现实的值，该属性也是准确的。此外，提供了近似错误概率曲线$ \ sim \ sim \ sim \ sim \ exp \ left（-n \，i \ right）$，这要归功于精制的渐近导数（通常称为精确的渐近学），其中$ \ zeta_n $代表$ \ zeta_n $误差概率的大多数代表性亚指数项。

translated by 谷歌翻译